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5 <110> APPLICANT: Degussa AG 

8 <120> TITLE OF INVENTION: Process for the preparation of L- threonine 





11 


<130> FILE REFERENCE: 030217BT 




















c--> 


14 


<140> CURRENT APPLICATION NUMBER: US/10/566 , 606 












C--> 


14 


<141> CURRENT PILING DATE: 2006- 


-01-31 


















14 


<160> NUMBER OF 


SEQ 


ID NOS: 


10 






















17 


<170> SOFTWARE: 


Patentln version 3.1 


















20 


<210> SEQ ID NO 


: 1 


























21 


<211> LENGTH: 993 


























22 


<212> TYPE: DNA 




























23 


<213> ORGANISM: 


Escherichia 


coli 




















26 


<220> FEATURE: 




























27 


<221> NAME/KEY: 


CDS 


























28 


<222> LOCATION: 


(1) 


. . (990) 
























29 


<223> OTHER INFORMATION: rpoS gene 




















32 


<400> SEQUENCE: 


1 


























33 


atg agt cag aat 


acg 


ctg aaa 


gtt 


cat 


gat 


tta 


aat 


gaa 


gat 


gcg 


gaa 


48 




34 


Met Ser Gin Asn 


Thr 


Leu Lys 


Val 


His 


Asp 


Leu 


Asn 


Glu 


Asp 


Ala 


Glu 






35 


1 


5 








10 










15 








37 


ttt gat gag aac 


gga 


gtt gag 


gtt 


ttt 


gac 


gaa 


aag 


gcc 


tta 


gta 


gaa 


96 




38 


Phe Asp Glu Asn 


Gly 


Val Glu 


Val 


Phe 


Asp 


Glu 


Lys 


Ala 


Leu 


Val 


Glu 






39 


20 








25 










30 










41 


cag gaa ccc agt 


gat 


aac gat 


ttg 


gcc 


gaa 


gag 


gaa 


ctg 


tta 


teg 


cag 


144 




42 


Gin Glu Pro Ser 


Asp 


Asn Asp 


Leu 


Ala 


Glu 


Glu 


Glu 


Leu 


Leu 


Ser 


Gin 






43 


35 






40 










45 












45 


gga gcc aca cag 


cgt 


gtg ttg 


gac 


gcg 


act 


cag 


ctt 


tac 


ctt 


ggt 


gag 


192 




46 


Gly Ala Thr Gin 


Arg 


Val Leu 


Asp 


Ala 


Thr 


Gin 


Leu 


Tyr 


Leu 


Gly 


Glu 






47 


50 




55 










60 














49 


att ggt tat tea 


cca 


ctg tta 


acg 


gcc 


gaa 


gaa 


gaa 


gtt 


tat 


ttt 


gcg 


240 




50 


lie Gly Tyr Ser 


Pro 


Leu Leu 


Thr 


Ala 


Glu 


Glu 


Glu 


Val 


Tyr 


Phe 


Ala 






51 


65 




70 








75 










80 






53 


cgt cgc gca ctg 


cgt 


gga gat 


gtc 


gcc 


tct 


cgc 


cgc 


egg 


atg 


ate 


gag 


288 




54 


Arg Arg Ala Leu 


Arg 


Gly Asp 


Val 


Ala 


Ser 


Arg 


Arg 


Arg 


Met 


He 


Glu 






55 




85 








90 










95 








57 


agt aac ttg cgt 


ctg 


gtg gta 


aaa 


att 


gcc 


cgc 


cgt 


tat 


ggc 


aat 


cgt 


336 




58 


Ser Asn Leu Arg 


Leu 


Val Val 


Lys 


He 


Ala 


Arg 


Arg 


Tyr 


Gly 


Asn 


Arg 






59 


100 








105 










110 










62 


ggt ctg gcg ttg 


ctg 


gac ctt 


ate 


gaa 


gag 


ggc 


aac 


ctg 


ggg 


ctg 


ate 


384 




63 


Gly Leu Ala Leu 


Leu 


Asp Leu 


He 


Glu 


Glu 


Gly 


Asn 


Leu 


Gly 


Leu 


He 






64 


115 






120 










125 












66 


cgc gcg gta gag 


aag 


ttt gac 


ccg 


gaa 


cgt 


ggt 


ttc 


cgc 


ttc 


tea 


aca 


432 




67 


Arg Ala Val Glu 


Lys 


Phe Asp 


Pro 


Glu 


Arg 


Gly 


Phe 


Arg 


Phe 


Ser 


Thr 






68 


130 




135 










140 













file://C:\CRF4\Outhold\VsrJ566606.htm 



2/7/2006 



Page 2 of 7 



RAW SEQUENCE LISTING DATE: 02/07/2006 

PATENT APPLICATION: US/10/566,606 TIME: 09:51:17 



Input Set : A: \ SEQUENCE LISTING. txt 
Output Set: N:\CRF4\02012006\J566606.raw 



Id 
/ u 


t- =j r* 


gca 


acc 


tgg 


tgg 


_ 4_ i_ 

ate 


cgc 


cag 


acg 


att 


gaa 


egg 


gcg 


att 


atg 


aac 


480 


71 


lyr 


Hid 


Thr 


Trp 


Trp 


Tin 

lie 


Arg 


r*1 n 
uin 


inr 


lie 


vjlU 


Arg 


Ala 


lie 


Met 


Asn 




no 

1 A 


1 AC 










T E~ A 

lbU 










155 










16 0 




1 A 


caa 


acc 


cgt 


act 


act 


cgt 


ttg 


ccg 


att 


cac 


ate 


gta 


aag 


gag 


ctg 


aac 


528 


/ D 




i nr 


Arg 


Thr 


i le 


Arg 


Leu 


Pro 


lie 


HIS 


lie 


val 


Lys 


Glu 


Leu 


Asn 




/b 










165 










170 










175 






/o 


gtt 


tac 


ctg 


cga 


acc 


gca 


cgt 


gag 


ttg 


tec 


cat 


aag 


ctg 


gac 


cat 


gaa 


576 


*7 Q 


vai 


Tyr 


Leu Arg 


Thr 


Ala 


Arg 


Glu 


Leu 


Ser 


His 


Lys 


Leu 


Asp 


His 


Glu 




OvJ 








180 










185 










190 








82 


cca 


agt 


gcg 


gaa 


gag 


ate 


gca 


gag 


caa 


ctg 


gat 


aag 


cca 


gtt 


gat 


gac 


624 




Pro 


Ser 


Ala 


Glu 


Glu 


He 


TV T _ 

Ala 


Glu 


Gin 


Leu 


Asp 


Lys 


Pro 


Val 


Asp 


Asp 




Oft 






195 










O A A 

zUU 










205 










86 


gtc 


age 


cgt 


atg 


ctt 


cgt 


ctt 


aac 


gag 


cgc 


att 


acc 


teg 


gta 


gac 


acc 


672 


87 


Val 


Ser 


Arg Met 


Leu 


Arg 


Leu 


Asn 


Glu 


Arg 


He 


Thr 


Ser 


Val 


Asp 


Thr 




88 




210 










215 










220 












90 


ccg 


ctg 


ggt 


ggt 


gat 


tec 


gaa 


aaa 


gcg 


ttg 


ctg 


gac 


ate 


ctg 


gee 


gat 


720 


91 


Pro 


Leu 


Gly Gly 


Asp 


Ser 


Glu 


Lys 


Ala 


Leu 


Leu 


Asp 


He 


Leu 


Ala 


Asp 




92 


225 










230 










235 










240 




94 


gaa 


aaa 


gag 


aac 


ggt 


ccg 


gaa 


gat 


acc 


acg 


caa 


gat 


gac 


gat 


atg 


aag 


768 


95 


Glu 


Lys 


Glu 


Asn 


Gly 


Pro 


Glu 


Asp 


Thr 


Thr 


Gin 


Asp 


Asp 


Asp 


Met 


Lys 




96 










245 










250 










255 






98 


cag 


age 


ate 


gtc 


aaa 


tgg 


ctg 


ttc 


gag 


ctg 


aac 


gee 


aaa 


cag 


cgt 


gaa 


816 


99 


Gin 


Ser 


He 


Val 


Lys 


Trp 


Leu 


Phe 


Glu 


Leu 


Asn 


Ala 


Lys 


Gin 


Arg 


Glu 





100 






260 








265 






270 








102 


gtg 


ctg gca 


cgt cga 


ttc 


ggt 


ttg 


ctg ggg tac 


gaa 


gcg 


gca 


aca 


ctg 


864 


103 


Val 


Leu Ala 


Arg Arg 


Phe 


Gly Leu 


Leu Gly Tyr 


Glu 


Ala 


Ala 


Thr 


Leu 




104 




275 








280 






285 










106 


gaa 


gat gta 


ggt cgt 


gaa 


att 


ggc 


etc acc cgt 


gaa 


cgt 


gtt 


cgc 


cag 


912 


107 


Glu 


Asp Val 


Gly Arg 


Glu 


He 


Gly 


Leu Thr Arg 


Glu Arg Val Arg Gin 




108 




290 






295 






300 












110 


att 


cag gtt 


gaa ggc 


ctg 


cgc 


cgt 


ttg cgc gaa 


ate 


ctg 


caa 


acg 


cag 


960 


111 


He 


Gin Val 


Glu Gly 


Leu 


Arg 


Arg 


Leu Arg Glu 


He 


Leu 


Gin 


Thr 


Gin 




112 


305 






310 






315 










320 




114 


ggg 


ctg aat 


ate gaa 


gcg 


ctg 


ttc 


cgc gag taa 












993 


115 


Gly 


Leu Asn 


He Glu 


Ala 


Leu 


Phe 


Arg Glu 














116 






325 








330 















118 <210> SEQ ID NO: 2 

119 <211> LENGTH: 330 

120 <212> TYPE: PRT 

121 <213> ORGANISM: Escherichia coli 

124 <400> SEQUENCE: 2 

125 Met Ser Gin Asn Thr Leu Lys Val His Asp Leu Asn Glu Asp Ala Glu 

126 15 10 15 

12 8 Phe Asp Glu Asn Gly Val Glu Val Phe Asp Glu Lys Ala Leu Val Glu 
129 20 25 30 

131 Gin Glu Pro Ser Asp Asn Asp Leu Ala Glu Glu Glu Leu Leu Ser Gin 

132 35 40 45 

134 Gly Ala Thr Gin Arg Val Leu Asp Ala Thr Gin Leu Tyr Leu Gly Glu 

135 50 55 60 
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1^7 


Tl o n~] \r T'-t r-y- 

\j±y j.yr 


Ser 


Pro 


Leu 


Leu 


Thr 


Aia 


bill 


CjIU 


CjIU 


vai 


Tyr 


pne 


Aia 


-i. J o 








i fi 










/b 










O ft 

o U 




Aiy Airy Ala 


Leu 


Arg 


vjiy 


Asp 


vai 


Aia 


Ser 


Arg 


Arg 


Arg 


Met 


lie 


Glu 


j. *± j. 






O D 










q n 










ft c 

yb 




14 j 


Ser Asn Leu 


Arg 


T All 

Leu 


Val 


Val 


Lys 


He 


Ala 


Arg 


Arg 


Tyr 


Gly 


Asn 


Arg 






inn 
1UU 










T ft C 

105 










110 






1 AC 


biy Lieu Ala. 


Leu 


Leu 


ASp 


Leu 


lie 


Glu 


Glu 


Gly 


Asn 


Leu 


Gly 


Leu 


He 


lffc / 


lie 
lib 










t O ft 

lzO 










125 








TAG 


Arg Ala vai 


GlU 


Lys 


Phe 


Asp 


Pro 


Glu 


Arg 


Gly 


Phe 


Arg 


Phe 


Ser 


Thr 




IjU 








T O C 

lib 










140 










ICO 

lbz 


Tyr Ala Thr 


Trp 


Trp 


He 


Arg 


Gin 


Thr 


He 


Glu 


Arg 


Ala 


He 


Met 


Asn 










loU 










lbb 










160 


lbb 


Gin Thr Arg 


Thr 


He 


Arg 


Leu 


Pro 


He 


His 


He 


Val 


Lys 


Glu 


Leu 


Asn 


1 trzr 
1DD 






165 










170 










175 




1 ro 

lbo 


Val Tyr Leu 


Arg 


Thr 


Ala 


Arg 


Glu 


Leu 


Ser 


His 


Lys 


Leu 


Asp 


His 


Glu 


ICQ 




1 Oft 

180 










185 










190 






161 


Pro Ser Ala 


Glu 


Glu 


He 


Ala 


Glu 


Gin 


Leu 


Asp 


Lys 


Pro 


Val 


Asp 


Asp 


1 CO 


1 ft C 

iy5 










200 










205 








164 


Val Ser Arg 


Met 


Leu 


Arg 


Leu 


Asn 


Glu 


Arg 


He 


Thr 


Ser 


Val 


Asp 


Thr 


16b 


210 








215 










220 










167 


Pro Leu Gly 


Gly 


Asp 


Ser 


Glu 


Lys 


Ala 


Leu 


Leu 


Asp 


He 


Leu 


Ala 


Asp 


168 


225 






230 










235 










240 


170 


Glu Lys Glu 


Asn 


Gly 


Pro 


Glu 


Asp 


Thr 


Thr 


Gin 


Asp 


Asp 


Asp 


Met 


Lys 


1 /l 






245 










250 










255 




173 


Gin Ser lie 


Val 


Lys 


Trp 


Leu 


Phe 


Glu 


Leu 


Asn 


Ala 


Lys 


Gin 


Arg 


Glu 


174 




260 










265 










270 






176 


Val Leu Ala 


Arg 


Arg 


Phe 


Gly 


Leu 


Leu 


Gly 


Tyr 


Glu 


Ala 


Ala 


Thr 


Leu 


177 


275 










280 










285 








179 


Glu Asp Val 


Gly 


Arg 


Glu 


He 


Gly 


Leu 


Thr 


Arg 


Glu 


Arg 


Val 


Arg 


Gin 


180 


290 








295 










300 










182 


He Gin Val 


Glu 


Gly 


Leu 


Arg 


Arg 


Leu 


Arg 


Glu 


He 


Leu 


Gin 


Thr 


Gin 


183 


305 






310 










315 










320 


186 


Gly Leu Asn 


He 


Glu 


Ala 


Leu 


Phe 


Arg 


Glu 














187 






325 










330 














189 


<210> SEQ ID NO: 


: 3 

























190 <211> LENGTH: 993 

191 <212> TYPE: DNA 

192 <213> ORGANISM: Escherichia coli 

195 <220> FEATURE: 

196 <221> NAME/KEY: Allele 

197 <222> LOCATION: (1) . . (990) 

198 <223> OTHER INFORMATION: rpoS allele 

201 <220> FEATURE: 

202 <221> NAME/KEY: misc_feature 

203 <222> LOCATION: (97).. (99) 

204 <223> OTHER INFORMATION: amber codon 

207 <400> SEQUENCE: 3 

208 atgagtcaga ataegctgaa agttcatgat ttaaatgaag atgeggaatt tgatgagaac 60 
210 ggagttgagg tttttgacga aaaggectta gtagaatagg aacccagtga taacgatttg 120 
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212 gccgaagagg aactgttatc gcagggagcc acacagcgtg tgttggacgc gactcagctt 180 

214 taccttggtg agattggtta ttcaccactg ttaacggccg aagaagaagt ttattttgcg 240 

216 cgtcgcgcac tgcgtggaga tgtcgcctct cgccgccgga tgatcgagag taacttgcgt 300 

218 ctggtggtaa aaattgcccg ccgttatggc aatcgtggtc tggcgttgct ggaccttatc 360 

22 0 gaagagggca acctggggct gatccgcgcg gtagagaagt ttgacccgga acgtggtttc 42 0 
222 cgcttctcaa catacgcaac ctggtggatt cgccagacga ttgaacgggc gattatgaac 480 
224 caaacccgta ctattcgttt gccgattcac atcgtaaagg agctgaacgt ttacctgcga 540 
226 accgcacgtg agttgtccca taagctggac catgaaccaa gtgcggaaga gatcgcagag 600 
228 caactggata agccagttga tgacgtcagc cgtatgcttc gtcttaacga gcgcattacc 660 

23 0 tcggtagaca ccccgctggg tggtgattcc gaaaaagcgt tgctggacat cctggccgat 720 
232 gaaaaagaga acggtccgga agataccacg caagatgacg atatgaagca gagcatcgtc 780 
234 aaatggctgt tcgagctgaa cgccaaacag cgtgaagtgc tggcacgtcg attcggtttg 840 
236 ctggggtacg aagcggcaac actggaagat gtaggtcgtg aaattggcct cacccgtgaa 900 
238 cgtgttcgcc agattcaggt tgaaggcctg cgccgtttgc gcgaaatcct gcaaacgcag 960 
240 gggctgaata tcgaagcgct gttccgcgag taa 993 



242 <210> SEQ ID NO: 4 

243 <211> LENGTH: 75 

244 <212> TYPE: DNA 

245 <213> ORGANISM: Escherichia coli 

248 <220> FEATURE: 

249 <221> NAME /KEY : tRNA 

250 <222> LOCATION: (1) . . (75) 

251 <223> OTHER INFORMATION: supE allele 

254 <400> SEQUENCE: 4 

255 tggggtatcg ccaagcggta aggcaccgga ttctaattcc ggcattccga ggttcgaatc 60 
257 ctcgtacccc agcca 75 

259 <210> SEQ ID NO: 5 

260 <211> LENGTH: 1545 

261 <212> TYPE: DNA 

262 <213> ORGANISM: Escherichia coli 

265 <220> FEATURE: 

266 <221> NAME/KEY: CDS 

267 <222> LOCATION: (1) . . (1542) 

268 <223> OTHER INFORMATION: ilvA gene 
270 <400> SEQUENCE: 5 



271 


atg 


get 


gac 


teg 


caa 


ccc 


ctg 


tec 


ggt 


get 


ccg 


gaa 


ggt 


gee 


gaa 


tat 


48 


272 


Met 


Ala 


Asp 


Ser 


Gin 


Pro 


Leu 


Ser 


Gly Ala 


Pro Glu Gly Ala Glu Tyr 




273 


1 








5 










10 










15 






275 


tta 


aga 


gca 


gtg 


ctg 


cgc 


gcg 


ccg 


gtt 


tac 


gag 


gcg 


gcg 


cag 


gtt 


acg 


96 


276 


Leu 


Arg 


Ala 


Val 


Leu 


Arg Ala 


Pro 


Val 


Tyr 


Glu 


Ala 


Ala 


Gin 


Val 


Thr 




277 








20 










25 










30 








279 


ccg 


eta 


caa 


aaa 


atg 


gaa 


aaa 


ctg 


teg 


teg 


cgt 


ctt 


gat 


aac 


gtc 


att 


144 


280 


Pro 


Leu 


Gin 


Lys 


Met 


Glu 


Lys 


Leu 


Ser Ser Arg Leu Asp Asn Val 


He 




281 






35 










40 










45 










283 


ctg 


gtg 


aag 


cgc 


gaa 


gat 


cgc 


cag 


cca 


gtg 


cac 


age 


ttt 


aag 


ctg 


cgc 


192 


284 


Leu 


Val 


Lys 


Arg 


Glu 


Asp Arg 


Gin 


Pro 


Val 


His 


Ser 


Phe 


Lys 


Leu Arg 




285 




50 










55 










60 












287 


ggc 


gca 


tac 


gec 


atg 


atg 


gcg 


ggc 


ctg 


acg 


gaa 


gaa 


cag 


aaa 


gcg 


cac 


240 


288 


Gly 


Ala 


Tyr 


Ala 


Met 


Met 


Ala 


Gly 


Leu 


Thr 


Glu 


Glu 


Gin 


Lys 


Ala 


His 
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Val 
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ctg 
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aac 
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gcg 


aaa 


gec 
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gcg 


ate 


gaa 


432 


"2 n 


Leu 


Leu 


His Gly Ala 


Asn 


Phe 


Asp 


Glu 


Ala 


Lys 


Ala 


Lys 


Ala 


He 


Glu 




"3 A C 

305 




130 
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t n t 
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480 
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Trp 


Val 
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Pro 


Phe 
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Pro 
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309 
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150 
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160 




311 


atg 


gtg 


att gec ggg 


caa 


ggc 


acg 


ctg 


gcg 


ctg 


gaa 


ctg 


etc 
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cag 


528 
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Val 
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Thr 
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Leu 


Gin 


Gin 




J 13 
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cat etc gac 


cgc 


gta 


ttt 
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ggt 


ctg 


576 


Jib 
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Val 


Phe 


Val 
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Val 


Gly 


Gly 


Gly 


Gly Leu 




Tin 
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get 
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